Document made available under the 
Patent Cooperation Treaty (PCT) 



International application number: PCT/US05/005996 
International filing date: 24 February 2005 (24.02.2005) 



Document type: Certified copy of priority document 

Document details: Country/Office: US 

Number: 60/547,526 

Filing date: 25 February 2004 (25.02.2004) 



Date of receipt at the International Bureau: 07 April 2005 (07.04.2005) 



Remark: Priority document submitted or transmitted to the International Bureau in 
compliance with Rule 17.1(a) or (b) 




World Intellectual Property Organization (WIPO) - Geneva, Switzerland 
Organisation Mondiale de la Propriete Intellectuelle (OMPI) - Geneve, Suisse 



THIS IS TO CERTIFY THAT ANNEXED HERETO IS A TRUE COPY FROM 
THE RECORDS OF THE UNITED STATES PATENT AND TRADEMARK 
OFFICE OF THOSE PAPERS OF THE BELOW IDENTIFIED PATENT 
APPLICATION THAT MET THE REQUIREMENTS TO BE GRANTED A 
FILING DATE. 



APPLICATION NUMBER: 60/547,526 
FILING DATE: February 25, 2004 

RELATED PCT APPLICATION NUMBER: PCT/US05/05996 



t :NI I I 1> STATES DKPARIMEY) OF COMMERCE 



United Sink's Paten! ami Trademark Office 



March 31, 2005 




Under Secretary of Commerce 
for Intellectual Property 
a»u! Director of the United Slat es 
Patent and Trademark Office 



Certified by 




13049 U.S. PTO 



December 19, 1995 

PTO/SB/1601-95) 
Approved for use through 01/31/98. OMB 0651-0037 
demark Office; U.S. DEPARTMENT OF COMMERCE 



PROVISIONAL APPLICA TION FOR PATENT COVER SHEET 







689290-186 


Type a plus sign (+) 


+ 


INVENTORS 


LAST NAME 


FIRST NAME 


MIDDLE INITIAL 


RESIDENCE (CITY AND EITHE 


R STATE OR FOREIGN COUNTRY) 


Strovel 

Bol 

Young 


Jeffrey 
David 
Paul 


w. 


Laurel, MD 
Gaithersburg, MD 
Gaithersburg, MD 










TITLE 








3 $M 




CHEMO-CENTRIC SELECTION OF DISEASE-RELATED GENETIC PROFILES 



CORRESPONDENCE ADDRESS 



OLSTEIN, Elliot M. (Reg. No. 24,025); GILFILLAN, John G. (Reg. No. 22,746); BAIN, John N. (Reg. No. 18,651); 
LILLIE, Raymond J. (Reg. No. 31,778); SQUIRE, William (Reg. No. 25,378; GRANT, Alan J. (Reg. NO. 33,389) ; 

Alan J. Grant, Esq., CARELLA, BYRNE, BAIN, GILFILLAN, CECCHJ, STEWART & OLSTEIN, 6 Becker Farm Road, Roseland, NJ 07068. 



ENCLOSED APPLICATION PARTS (check all that apply) 



| Specification 
| Drawing(s) 



Number of Pages 33 
Number of Sheets 1 



^ Small Entity Statement 
□ Other (specify) 



METHOD OF PAYMENT OF FILING FEES FOR THIS PROVISIONAL APPLICATION FOR PATENT (check one) 



A check or money order is enclosed to cover the riling fees 



The Commissioner is hereby authorized to charge 03-0678 
filing fees and credit Deposit Account Number: 



FILING FEE 

AMOUNT FOR SMALL ENTITY 



The invention was made by an agency of the United States Government or under a contract with an agency of the United States Government. 

^ No. 

I I Yes. The name of the U.S. Government agency and the Government contract number are: 



Respectfully submitted, 
SIGNATURE 
TYPE or PRINTED NAME Man J. Grant, Esq. 
Q Additional ir 



DATE 25 February 2004 



REGISTRATION NO. 



33,389 



rs are being named on separately numbered sheets attached hereto 

USE ONL Y FOR FILING A PROVISIONAL APPLICA TION FOR PA TENT 

t: This form is estimated to take .2 hours to complete. Time will vary depending upon the needs of the individual case. Any comments on the amount of time 
you are required to complete this form should be sent to the Chief Information Officer, Patent and Trademark Office, Washington, D.C. 2023 1 . DO NOT SEND FEES OR 
COMPLETED FORMS TO THIS ADDRESS. SEND TO: Box Provisional Application, Assistant Commissioner for Patents, Washington, DC 20231 . 



#52293 vl - Prov.App.Cvr.Sht 



EXPRESS MAIL NO: EL998936994US 



CHEMO-CENTRIC SELECTION OF DISEASE- 
RELATED GENETIC PROFILES 



1 0 FIELD OF THE INVENTION 



The present invention relates to methods of utilizing biologically active 
compounds for the identification and selection of gene sets having characteristic 
15 expression profiles for an identifiable disease process, including use of 
compound-responsive gene signatures to predict sensitivity and resistance to 
experimental therapeutics. 



BACKGROUND OF THE INVENTION 

Many different agents are known to possess biological activity, including 
therapeutic activity, and for many of these the molecular mechanism of action is 

25 known. Thus, such compounds may be determined to be related to each other in 
that they have a common mechanism of action, which mechanism may bear 
some relationship to the chemical properties of the compounds or to their overall 
molecular shape. Alternatively, such compounds may not be similar in overall 
molecular shape or properties but may still, for diverse reasons, operate 

30 biologically in a similar manner. In addition, such compounds, related by 
mechanism of action (MOA) may also show other properties in common and thus 
these MOA-related sets of compounds may be formed into distinct groups based 
on their common biological activity. 



In a similar way, diverse cells may be related in terms of their susceptibility 
to a given chemical agent, or test compound, which may act by modifying the 
expression profile of a given set of genes within the genome of the cells. Thus, 
an expression profile may be formulated for a given gene set, the latter being 
5 some subset of the genome of the cell, and this expression profile may be 
modulated by the presence of a particular chemical agent. It would be 
advantageous to be able to take advantage of such a relationship based on 
common expression profiles, especially where the given gene set is related to a 
disease process or to the viability of the cell. 

10 

Because methods of analyzing gene expression are subject to use in large 
screening assays, where such methods, including rapid measurement of 
messenger RNA species coupled with methods of reverse transcriptase- 
polymerase chain reaction amplification for ease of measurement, are 
15 susceptible to high degrees of automation, such genetic methods present 
themselves as a ready medium for high throughput screening for agents having 
a selected biological activity. 

Heretofore many expression profiles of diverse gene sets of different cells 
20 have been determined and most are available in public data bases. In addition, 
workers in the disease treatment area have attempted to determine expression 
profiles of cells in different diseases, such as cancer, in the hopes of finding an 
agent that changes the expression profile and thereby serves to alleviate the 
disease condition, such as by killing the cell involved (for example, a cancer cell). 
25 However, one problem with this approach is that one does not know beforehand 
whether a given disease is amenable to treatment with a given therapeutic agent 
and consequently large scale screening processes have been developed, with 
time and expense a major factor. Since many known therapeutic agents are 
already available it would be advantageous if one could ascertain the likelihood 
30 that a given cell would be susceptible to a given agent before embarking on large 
scale screening processes. 
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The present invention solves this problem by taking advantage of such 
methods to provide expression profiles of different cells. In so doing, the likely 
susceptibility of a cell to a selected therapeutic agent can be determined by 
finding cells with the same gene expression profile as a cell known to be 
5 susceptible to said therapeutic agent and where said agent modulates the 
expression profile of the given gene set. 



1 0 BRIEF SUMMARY OF THE INVENTION 

In one aspect, the present invention relates to a method for identifying a 
cell susceptible to a selected chemical agent, comprising: 

(a) identifying a cell susceptible to a test compound wherein said 
15 susceptibility depends on a change in expression profile of a set of genes 

expressed by the susceptible cell and due to said test compound 

(b) identifying a different cell from said susceptible cell wherein said 
different cell expresses the set of gene of (a) with the expression profile of said 
set of genes of said susceptible cell absent the change due to said test 

20 compound, 

wherein said expression profile identifies said different cell as being a cell 
susceptible to said test compound 

thereby identifying said different cell as a cell susceptible to said selected 
chemical agent. 

25 

In another aspect, the present invention relates to a method for identifying 
a cell susceptible to a selected chemical agent, comprising: 

(a) contacting a test compound with a cell and determining inhibition of 
growth of said cell due to said contacting thereby identifying said cell as a 
30 susceptible cell, 
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(b) determining a change in the expression profile of a set of genes 
expressed by said susceptible cell wherein said change is due to said contacting 
thereby identifying said set of genes as an affected gene set, 

(c) determining the expression profile of said affected gene set of said 
5 susceptible cell in the absence of said contacting and thereby identifying a basal 

expression profile for said affected gene set, 

(d) determining said basal expression profile for said affected gene set for 
a different cell from said contacted cell, 

wherein said basal expression profile for said affected gene set indicates a 
1 0 cell susceptible to said test compound 

thereby identifying said different cell as a cell susceptible to said selected 
chemical agent. 

In another aspect, the present invention relates to a method for treating a 
1 5 disease in a mammal comprising administering to an mammal afflicted with said 
disease of a therapeutically effective amount of the test compound above 
wherein said disease is caused by a cell having the basal expression profile of 
the above-recited gene set. In a preferred embodiment, the mammal is a human 
patient and said disease is cancer. 

20 

In a separate aspect, the present invention relates to a method of 
identifying a cancer patient likely to respond positively to treatment with a 
selected anti-neoplastic agent comprising evaluating said patient's cancer for the 
presence of a cell equivalent to a different cell as identified by the method of the 
25 invention wherein said anti-neoplastic agent is the test compound of the 
invention. 

In an alternative aspect, the present invention relates to a method of 
identifying a cancer patient unlikely to respond positively to treatment with a 
30 selected anti-neoplastic agent comprising evaluating said patient's cancer for the 
absence of a cell equivalent to a different cell as identified by the method of the 

4 



invention wherein said anti-neoplastic agent is the test compound of the 
invention. 



5 

BRIEF DESCRIPTION OF THE DRAWING 

Figure 1 shows a compound-centric approach to selection of sensitive and 
1 0 resistant cell lines, yielding a cell line, dubbed colo205, that is as sensitive as the 
original cell line (MCF-7) and 2 lines that are more than 10 fold less sensitive by 
growth inhibition. Thus, the present invention is useful in identifying cells as 
sensitive or more resistant to reference cells. 

15 

DEFINITIONS 

As used herein, unless expressly stated otherwise, the following terms 
20 have the indicated meaning. 

In accordance with the present invention, the term "DNA segment" or "DNA 
sequence" refers to a DNA polymer, in the form of a separate fragment or as a 
component of a larger DNA construct, which has been derived from DNA isolated 

25 at least once in substantially pure form, i.e., free of contaminating endogenous 
materials and in a quantity or concentration enabling identification, manipulation, 
and recovery of the segment and its component nucleotide sequences by 
standard biochemical methods, for example, using a cloning vector. Such 
segments are provided in the form of an open reading frame uninterrupted by 

30 internal non-translated sequences, or introns, which are typically present in 
eukaryotic genes. Sequences of non-translated DNA may be present 
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downstream from the open reading frame, where the same do not interfere with 
manipulation or expression of the coding regions. 

The term "coding region" refers to that portion of a gene which either 
5 naturally or normally codes for the expression product of that gene in its natural 
genomic environment, i.e., the region coding in vivo for the native expression 
product of the gene. The coding region can be from a normal, mutated or altered 
gene, or can even be from a DNA sequence, or gene, wholly synthesized in the 
laboratory using methods well known to those of skill in the art of DNA synthesis. 

10 

In accordance with the present invention, the term "nucleotide sequence" 
refers to a heteropolymer of deoxyribonucleotides. Generally, DNA segments 
encoding the proteins provided by this invention are assembled from cDNA 
fragments and short oligonucleotide linkers, or from a series of oligonucleotides, 
1 5 to provide a synthetic gene which is capable of being expressed in a recombinant 
transcriptional unit comprising regulatory elements derived from a microbial or 
viral operon. 

The term "expression product" means that polypeptide or protein that is the 
20 natural translation product of the gene and any nucleic acid sequence coding 
equivalents resulting from genetic code degeneracy and thus coding for the 
same amino acid(s). 

The term "promoter" means a region of DNA involved in binding of RNA 
25 polymerase to initiate transcription. The term "enhancer" refers to a region of 
DNA that, when present and active, has the effect of increasing expression of a 
different DNA sequence that is being expressed, thereby increasing the amount 
of expression product formed from said different DNA sequence. 
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The term "open reading frame (ORF)" means a series of triplets coding for 
amino acids without any termination codons and is a sequence (potentially) 
translatable into protein. 



5 As used herein, reference to a DNA sequence includes both single stranded 

and double stranded DNA. Thus, the specific sequence, unless the context 
indicates otherwise, refers to the single strand DNA of such sequence, the 
duplex of such sequence with its complement (double stranded DNA) and the 
complement of such sequence. 

10 

The term "percent identity" or "percent identical," when referring to a 
sequence, means that a sequence is compared to a claimed or described 
sequence after alignment of the sequence to be compared (the "Compared 
Sequence") with the described or claimed sequence (the "Reference Sequence"). 
1 5 The Percent Identity is then determined according to the following formula: 

Percent Identity = 100[1-(C/R)] 

wherein C is the number of differences between the Reference Sequence and the 
20 Compared Sequence over the length of alignment between the Reference 
Sequence and the Compared Sequence wherein (i) each base or amino acid in the 
Reference Sequence that does not have a corresponding aligned base or amino 
acid in the Compared Sequence and (ii) each gap in the Reference Sequence and 
(iii) each aligned base or amino acid in the Reference Sequence that is different 
25 from an aligned base or amino acid in the Compared Sequence, constitutes a 
difference; and R is the number of bases or amino acids in the Reference 
Sequence over the length of the alignment with the Compared Sequence with any 
gap created in the Reference Sequence also being counted as a base or amino 
acid. 

30 



7 



If an alignment exists between the Compared Sequence and the Reference 
Sequence for which the percent identity as calculated above is about equal to or 
greater than a specified minimum Percent Identity then the Compared Sequence 
has the specified minimum percent identity to the Reference Sequence even 
5 though alignments may exist in which the hereinabove calculated Percent Identity is 
less than the specified Percent Identity. 

As used herein, the terms "portion," "segment," and "fragment," when used 
in relation to polypeptides, refer to a continuous sequence of residues, such as 

1 0 amino acid residues, which sequence forms a subset of a larger sequence. For 
example, if a polypeptide were subjected to treatment with any of the common 
endopeptidases, such as trypsin or chymotrypsin, the oligopeptides resulting from 
such treatment would represent portions, segments or fragments of the starting 
polypeptide. When used in relation to a polynucleotides, such terms refer to the 

1 5 products produced by treatment of said polynucleotides with any of the common 
endonucleases, or any stretch of polynucleotides that could be synthetically 
synthesized. 

The term "correspond" means that the gene has the indicated nucleotide 
20 sequence or that it encodes substantially the same RNA as would be encoded by 
the indicated sequence, the term "substantially" meaning about at least 90% 
identical as defined elsewhere herein and includes splice variants thereof. 

The term "corresponding genes" refers to genes that encode an RNA that 
25 is at least 90% identical, preferably at least 95% identical, most preferably at 
least 98% identical, and especially identical, to an RNA encoded by one of the 
nucleotide sequences disclosed herein (i.e., SEQ ID NO: 1-12). Such genes will 
also encode the same polypeptide sequence as any of the sequences disclosed 
herein, preferably SEQ ID NO: 1-12, but may include differences in such amino 
30 acid sequences where such differences are limited to conservative amino acid 
substitutions, such as where the same overall three dimensional structure, and 
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thus the same antigenic character, is maintained. Thus, amino acid sequences 
may be within the scope of the present invention where they react with the same 
antibodies that react with polypeptides comprising the sequences of SEQ ID NO: 
1-12 as disclosed herein. 

5 

The term "related gene set" refers to a set of genes, perhaps 5, 10 or 
more genes, such as those corresponding to the sequences disclosed herein, 
whose pattern of expression in a cell, expression is modulated by a given set of 
biologically active agents, especially where said agents exert said activity by a 
1 0 common molecular mechanism. 

As used herein, the terms "gene expression profile" or "gene expression 
fingerprint" are interchangeable and refer to the pattern of gene expression 
modulation, including increase or decrease of expression, exhibited by an the 

1 5 members of a set of chemical agents with established biological activity when 
determined using a related gene set. Thus, for a set of 10 genes, possibly genes 
1-6 are reduced in expression and genes 7-10 are increased in expression after 
contact with each of a set of agents having common biological activity. These 
genes represent a related gene set. The profile or fingerprint will include the 

20 relative degree of increase or decrease of expression of the genes of the set in 
response to the presence of a given concentration of an established biologically 
active agent (for example, expression of gene 1 may be reduced by half, gene 2 
by 2/3, gene 3 not expressed at all, gene 7 doubled in expression, gene 10 
increased 3 fold in expression, and so on in response to each of the compounds 

25 of the set and relative to the steady state levels of said genes). In the typical 
case, compound A is introduced into the growth medium of the cells. The result is 
a gene expression profile, or gene expression fingerprint, or expression 
fingerprint, for compound A and other compounds of the set possessing common 
biological activity. 

30 
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As used herein, the term "test compound" refers to a chemical compound, 
including small organic compounds or even larger structures, such as proteins or 
anti-sense agents, that are used to determine the susceptibility of the growth 
and/or viability of a cell following contact with the test compound and where the 
5 test compound is known to modulate, or change, either by decreasing or 
increasing expression, the expression profile of an identifiable set of genes within 
the genome of the cell susceptible to said test compound. 

"Basal gene expression" refers to the expression of a gene, or set of 
1 0 genes, when said genes, or a cell containing said genes, is not in contact with a 
test compound. Such expression may be measured by determining amount or 
rate of synthesis of RNA or protein (i.e., by transcription or translation) of by 
determining the level of enzyme activity of enzymes encoded by one or more of 
the genes of a gene set. 

15 

"Affected gene set" refers to a set of genes, being a subset of the genome 
of a cell, whose expression profile is altered by contacting the gene set, or a cell 
expressing the gene set, with a test compound where the cell is susceptible to 
the test compound and said susceptibility is related to, or caused by, a change in 

20 the expression profile of the gene set. When the cell is not contacted with said 
test compound the expression profile of the affected gene set would be deemed 
the basal expression profile. A gene set present in a cell different from the 
susceptible cell and not in contact with a test compound would be deemed an 
affected gene set if said gene set is made up of genes each of which 

25 corresponds to a gene of an affected gene set of a susceptible cell in contact 
with a test compound. 

"Susceptible cell" refers to a cell whose growth is modulated or altered, 
especially decreased, after contact with a test compound and where said 
30 modulation is related to, or dependent upon, a change in the expression profile of 
an affected gene set. Such modulation or alteration of cell growth may be 
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manifested as a cessation of cell multiplication, a decrease in metabolic activity 
of the cell or even death of the cell. 

As used herein, any algorithm that deems a test sample to share common 
5 gene expression patterns or gene expression copy numbers with that of the 
original sensitive cell line defines similarity. These methods can be comprised of 
clustering methods such as heirarchichal, principal component analysis, K- 
means, or profile matching (using pearson or spearman scores, etc). 

10 

DETAILED SUMMARY OF THE INVENTION 

The present invention discloses a method by which treatment-induced 
15 gene expression signatures are used to classify cancer indications, tumor 
models, and eventually patient populations that would be either sensitive or 
resistant to treatment with a class of compounds. This process could dramatically 
speed up drug discovery and lead optimization by directing a screening program 
to the most sensitive indications and tumor models for which the compounds are 
20 most suited. Presently, this is accomplished by determining the MOA or target of 
a compound series and then assaying tumor samples for the presence of such 
target in the hope that presence of the target correlates with sensitivity to the 
compound. 

25 When the target of a compound is not known, however, basal gene 

expression levels from groups of sensitive and resistant samples would be 
compared to find genes or signatures associated with sensitivity to the compound 
series. This process is time consuming and expensive. The present invention 
solves this problem by offering a method wherein one does not need to know the 

30 MOA of a compound series or to compile samples proven to be sensitive or 
resistant to treatment. Thus, application of the methods of the present invention 



11 



saves both time and money and requires only determining a treatment-induced 
genetic signature. The latter is then compared with basal gene expression in 
tumor samples to find other tumor indications, tumor models, and patient 
populations that will likely be amenable to treatment with the compound of 
5 interest. 

The methods of the invention thus describe a compound-centric approach 
for the detection of cell lines and clinical samples that are sensitive to treatment 
with a compound class. The method comprises treating sensitive cell lines with a 

1 0 compound to determine a gene signature related to the action of the compound. 
This gene signature is screened (for example, virtually, such as by relying on 
data analysis software on the entire GX2000 database of cell lines and clinical 
samples) on other cell lines and clinical samples to find samples with a similar 
(note below) basal gene signature to that of the original cell line's resting 

1 5 signature for those genes. Cell lines and samples that contain similar gene 
expression patterns to the original sensitive cell line can be tested for sensitivity 
to the compound family (see Figure 1). 

Thus, the present invention provides a method of using the gene signature 
20 induced by compounds to find a sensitive, and thus amenable to treatment, 
population of cells, tumors, or patients. 

The methods of the present invention find use in screening gene sets to 
find compounds that will exhibit a specific signature related to cancer cell 

25 sensitivity. They also find use as PD markers (i.e., a therapeutic set used to 
monitor chemotherapy for a particular compound class and to set the dosing 
requirements of compounds by monitoring the gene expression pattern in 
lymphocytes or other easily obtained tissue) as well as in patient population 
stratification into those likely to be responders versus non-responders. Use of the 

30 methods of the invention permits lead optimization programs will be faster and 
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more efficient because the proper indications for a compound class as well as 
the proper tumor models for in vivo studies can be picked up much earlier in the 
process, especially in cases where the target of the compound is unknown. 



5 Thus, the present invention relates to a method for identifying a cell 

susceptible to a selected chemical agent, comprising: 

(a) identifying a cell susceptible to a test compound wherein said 
susceptibility depends on a change in expression profile of a set of genes 
expressed by the susceptible cell and due to said test compound 
10 (b) identifying a different cell from said susceptible cell wherein said 

different cell expresses the set of gene of (a) with the expression profile of said 
set of genes of said susceptible cell absent the change due to said test 
compound, 

wherein said expression profile identifies said different cell as being a cell 
1 5 susceptible to said test compound 

thereby identifying said different cell as a cell susceptible to said selected 
chemical agent. 

In accordance with the present invention, an expression profile of an 
20 affected gene set might be determined as follows, although other means certainly 
present themselves to those skilled in the art. Model cellular systems using cell 
lines, primary cells, or tissue samples are maintained in growth medium and may 
be treated with compounds at a single concentration or at a range of 
concentrations. At specific times after treatment, cellular RNAs are isolated from 
25 the treated cells, primary cells or tissues, which RNAs are indicative of 
expression of the different genes. The cellular RNA is then divided and subjected 
to analysis that detects the presence and/or quantity of specific RNA transcripts, 
which transcripts may then be amplified for detection purposes using standard 
methodologies, such as, for example, reverse transcriptase polymerase chain 
30 reaction (RT-PCR), etc. The presence or absence, or levels, of specific RNA 
transcripts are determined from these measurements and a metric derived for the 
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type and degree of response of the sample versus the steady state levels of such 
transcripts when the compound is not present. The relative levels of RNA 
transcripts following said contacting with each of a set of agents having 
established biological activity, including therapeutic activity, such as anti- 
5 neoplastic activity, and/or enzyme inhibitory activity and the like serves to define 
a related gene set and the expression profile of this set provides the fingerprint 
for the established biologically active agent. 

The present invention also provides screening assays for identifying 
1 0 biologically active agents, whether the underlying chemical structures are novel 
or otherwise, based on the action of such agents to modulate such gene sets in a 
manner similar to that of an established biologically active agent. 

In one embodiment of the invention, an established biologically active 
1 5 agent, such as an agent found to inhibit the growth or metastasis of, or kill, 
cancerous cells, is used to identify a set of cancer related genes by determining 
the genes present in a cancerous cell whose expression is modulated when said 
cell is contacted with an agent (i.e., a test compound), such as one having 
established biological activity, including therapeutic activity, such as anti- 
20 neoplastic activity, and/or enzyme inhibitory activity and the like. Thus, as a result 
of such contacting, genes whose expression changed versus when said 
contacting does not occur (i.e., the steady state levels of such gene expression), 
are found to show increased or decreased expression, related to the therapeutic 
effect of the test compound, may then be grouped as an affected gene set (in this 
25 case, a cancer-related gene set). 

In a highly specific but non-limiting example, where said biological activity 
is anti-neoplastic activity, an established anti-neoplastic agent, compound A, is 
determined to modulate the expression of 10 genes found in a colon cancer cell, 
30 such as an adenocarcinoma, whereby genes these genes show a varying pattern 
of expression following contacting of the cell with compound A. For example, 
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genes 1 to 7 show reduced expression, or non-expression, while genes 8 to 10 
show expression, or increased expression, as a result of said contacting. This set 
of 10 genes thus represents a cancer related gene set as defined herein. Each of 
said 10 genes may be modulated to a different extent by said established anti- 
5 neoplastic agent. For example, expression of gene 1 may be reduced to a level 
where expression is no longer detected while gene 2 is reduced to half its 
expression when compound A is not present. The relative levels of expression of 
each of the genes in the presence and absence of compound A serves to 
establish an expression pattern, or expression profile. Expression in the absence 
1 0 of contacting with such test compound establishes a basal expression profile. 

In accordance with the invention, once a basal expression profile is known 
for an identified affected gene set other cells and tissues, related or unrelated to 
a susceptible cell, can then be determined by finding a similar basal expression 

1 5 profile for the same gene set in other cells and tissues. One way to do this is 
through the use of databases, including public databases, that provide 
expression levels of identified genes in diverse cells and tissues from varied 
sources and from different species. Thus, the susceptible cell and the affected 
gene set may be identified using a cell from one species, such as a mouse, and 

20 then a similar gene set and basal expression profile may be determined for a cell 
from a different species, such as a human being. 

In carrying out the methods of the invention, it is not essential to determine 
the identity of the test compound, susceptible cell, affected gene set and basal 
25 expression profile de novo as part of the methods of the invention. These may 
already be known and such information may already be available in publicly 
available sources. Once this information is attained, the initial steps of the 
claimed method are deemed to have been carried out. 

30 In one embodiment, the present invention relates to a method for 

identifying a cell susceptible to a selected chemical agent, comprising: 
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(a) contacting a test compound with a cell and determining inhibition of 
growth of said cell due to said contacting thereby identifying said cell as a 
susceptible cell, 

(b) determining a change in the expression profile of a set of genes 
5 expressed by said susceptible cell wherein said change is due to said contacting 

thereby identifying said set of genes as an affected gene set, 

(c) determining the expression profile of said affected gene set of said 
susceptible cell in the absence of said contacting and thereby identifying a basal 
expression profile for said affected gene set, 

1 0 (d) determining said basal expression profile for said affected gene set for 

a different cell from said contacted cell, 

wherein said basal expression profile for said affected gene set indicates a 
cell susceptible to said test compound 

thereby identifying said different cell as a cell susceptible to said selected 
1 5 chemical agent. 

In a preferred embodiment, said inhibition of growth is the death of said 
susceptible cell. In other such embodiments, said inhibition of growth is a 
cessation of multiplication of said susceptible cell or a decrease in the 
20 metabolism of such cell. 

In specific embodiments of any of the methods of the invention, the 
different cell is a cell of the same tissue type as said susceptible cell, preferably 
of the same species as said susceptible cell. In addition, said susceptible cell 
25 may be a cancer cell and/or said different cell is a cancer cell. In one 
embodiment, the susceptible cell is a recombinant cell, including a cell 
genetically engineered to express an affected gene set. In another embodiment, 
the different cell is a non-cancer cell. In preferred embodiments, the susceptible 
cell is a human cell and/or the different cell is a human cell. 
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In other embodiments of any of the methods of the invention, identification 
of the susceptible cell and test compound comprise retrieving such information 
from a database, such as a public database. In addition, the determining in step 
(c) of the above-recited method may comprise retrieving the basal expression 
5 profile of the affected set, as well as identification of the affected set where the 
mechanism of action of the test compound is known, from a database, such as a 
public database. In another such embodiment, the determining in step (d) 
comprises retrieving said basal expression profile from a database, such as a 
public database. In one such embodiment, the determining of steps (c) and (d) 
1 0 each comprises retrieving said basal expression profile from a database, such as 
a public database. 

In other embodiments of the above-recited method, the expression is 
transcription. In addition, the change in expression profile of step (b) may be 

1 5 determined by determining synthesis of RNA, including either amount of RNA 
produced, rate of production, or both. In another embodiment, the change in 
expression profile of step (b) is determined by determining polypeptide synthesis. 
In a further such embodiment, the change in expression profile of step (b) is 
determined by determining enzyme inhibitory activity. The identify an expression 

20 profile, such determining may be a combination of the foregoing, such as where 
transcription to produce RNA is determined, or known, for some genes and 
protein synthesis and/or activity is determined, or known, for others. In addition, it 
may be known for some genes of an affected gene set and determined for other 
genes of an affected gene set. 

25 

In another aspect, the present invention relates to a method for treating a 
disease in a mammal comprising administering to an mammal, preferably a 
human patient, afflicted with said disease of a therapeutically effective amount of 
the test compound, or an agent of the same class or mechanism of action of the 
30 test compound, wherein said disease is caused by a cell having the basal 
expression profile of an affected gene set in a cell known to be susceptible to 
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said test compound, or compounds of the same class, such as those having 
similar structure or general physiological function, or mechanism of action as the 
test compound. In a preferred embodiment, said disease is cancer and said test 
compound is an anti-neoplastic agent, such as a known anti-neoplastic agent. 
5 In an additional embodiment, the susceptible cell of (a) is a colon cell, 

such as a colon cancer cell. The cells utilized in the methods of the invention 
may also be recombinant cells engineered to express the determined genes, 
such as one or more genes of an affected gene set, including where the 
recombinant cell does not express the determined genes absent being 
1 0 engineered to do so, such as by genetic engineering. 

In one embodiment, the test compound of step (a) is not an agent 
possessing known biologically activity so that the methods of the invention find 
use in identifying novel agents with a selected biological activity. The invention is 
15 then useful in locating other cells and tissues having a similar gene-based 
susceptibility to the test compound. 

Thus, the present invention further relates to compounds identified as 
having biological activity by the methods of the invention. In preferred 
20 embodiments, such identified compounds have therapeutic activity, and/or anti- 
neoplastic activity, and/or enzyme inhibitory, as first determined by the methods 
disclosed herein but such activity is realized using cells or tissues whose 
susceptibility, or resistance, to the effects of the test compound were not 
theretofore appreciated. 

25 

The present invention also relates to a method for treating a disease 
comprising administering to an animal afflicted with said disease of a 
therapeutically effective amount of a compound identified by the methods of the 
invention as having therapeutic activity with a tissue different from one of known 
30 susceptibility. In a preferred embodiment, said therapeutic activity is anti- 
neoplastic activity. 
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Thus, the invention also encompasses cases where the agent, or test 
compound, may have been known to have a biological activity in one kind of cell 
but not others that can be tested using the methods herein. In addition, such 
5 known, or suspected, biological activity may have been previously determined to 
involve a different molecular mechanism than utilized by the methods of the 
present invention. 

In one embodiment, the affected gene set is a cancer related gene set, 
1 0 identified by the modulation of all of its member genes by a given anti-neoplastic 
agent. 

The methods of the present invention also find use in the stratification of 
patient populations into those likely to be responders (to a therapeutically active 
15 test compound) and those likely to be non-responders (i.e., resistant) to such 
agent. 

In accordance therewith, the present invention relates to a method of 
identifying a cancer patient likely to respond positively to treatment with a 
20 selected anti-neoplastic agent comprising evaluating said patient's cancer for the 
presence of a cell equivalent to a different cell as identified by the method of the 
invention wherein said anti-neoplastic agent is the test compound of the 
invention. 

25 Alternatively, the present invention relates to a method of identifying a 

cancer patient unlikely to respond positively to treatment with a selected anti- 
neoplastic agent comprising evaluating said patient's cancer for the absence of a 
cell equivalent to a different cell as identified by the method of the invention 
wherein said anti-neoplastic agent is the test compound of the invention. 
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Thus, the present invention permits clinicians to determine, prior to 
treatment with one of the different therapeutic agents currently available, the 
likelihood that a patient afflicted with a particular malady will respond positively to 
such treatment. By way of non-limiting example only, a patient afflicted with 
5 cancer may initially be considered as a candidate for any of a vast number of 
known anti-neoplastic agents currently approved for such use. However, as is 
appreciated by clinicians, not all anti-cancer drugs are equally useful against all 
cancers or in all patients, even those with the same type of cancer or where the 
same tissue or organ is cancerous. Thus, not all patients with colon cancer may 
10 respond equally well to a given anti-neoplastic agent while an agent useful 
against melanoma may not be useful against sarcoma or an agent useful 
against ovarian cancer may not be useful against colorectal cancer. In addition, 
valuable time may be wasted while the spectrum of anti-cancer drugs is tried in 
succession or where combination treatment proves ineffectual. 

15 

The present invention affords methods for determining the likely prognosis 
for using a selected anti-neoplastic agent prior to use. Thus, where a given agent 
is known to be effective against a given cancer cell, such as a specific type of 
cancer or cancer of a particular organ or tissue, or where the agent is determined 

20 to have such effect, and where said effect is due to a change in the expression of 
a selected gene, or expression profile of a selected set of genes, whether 
previously known or newly determined, and a basal expression profile can be 
identified for this cancer-related gene set (or affected gene set), it is then 
possible to scan databases and other types of information sources, especially 

25 where this can be done by computerized search, for other cells and tissues, 
especially cancerous cells and tissues, for example, cells drawn from different 
types of cancers, such as melanomas, carcinomas, sarcomas and the like, or for 
cancers of various organs, such as ovary, colon, stomach, and the like, for 
information on expression of the member genes of an affected gene set, 

30 especially where said expression matches the profile of the basal expression 
profile of an affected gene set modulated by an established anti-cancer drug, 
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such cancers then become likely targets of the same anti-cancer drug, or class of 
drugs, without the need to perform extended testing beforehand or wasting of 
time and money in trying drugs that may or may not work. 

5 In sum, where a selected anti-tumor agent works via change in expression 

profile of a given set of genes and the basal expression profile of this gene set in 
a susceptible cancer cell is the same as the basal expression profile in other 
cells, such as other cancer cells, it is deemed likely that the same drug will also 
operate against those tumors. 

10 

In this way, the methods of the invention provide a means of stratifying 
patients into groups based on whether their cancer is of a type that contains cells 
expressing a gene set with the same basal expression profile as that for the 
same, or corresponding, gene set of a cancerous cell known to be susceptible to 

1 5 a selected drug wherein the latter's mechanism of action comprises a change in 
the expression profile of said gene set. In this way, patients with cancers of 
different types, or cancers derived from different organs and tissues, can be 
assessed for positive prognosis with a given drug prior to start of any treatment 
regimen, thereby achieving a substantial increase in the likelihood of successful 

20 treatment. 

The methods of the invention also find use in facilitating research to 
develop drugs useful in the treatment of diseases such as cancer. 

25 Any compound that has an effect on cellular processes has a gene 

expression signature to describe those effects in a sensitive, selective way. 
Compounds that act via different mechanisms of action have distinct 
signatures at the gene expression level. Using gene transcription detection 
technologies, the present invention has been used to identify the genetic 
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signature of numerous available anticancer compounds, as well as novel 
compounds currently in lead optimization. Using the gene signatures specific 
for several selected anti-cancer test compounds, the methods of the 
invention were successful in determining the utility of each compound, or 
5 series of compounds, in particular types of cancer, identifying sensitive 
models for /n v/tro and /n v/vo evaluation, and building predictors for use in 
the clinical development of the test compounds. 

For example, in one instance the methods of the invention, employing 
10 the particular gene expression profile of a compound series in lead 
optimization, led away from optimization in colon cancer cells to evaluating 
these compounds in leukemia cell lines. 

In other cases, the use of gene expression signatures for a selected 
1 5 program in optimization helped identify a specific set of cell lines that were 
sensitive to the compounds being evaluated (see the results of Figure 1). In 
addition, other data suggests that the specific signature of a selected 
compound is the best tool to use for identifying patient populations during 
the clinical development of that compound. Thus, utilizing the methods of 
20 the present invention, such decisions are made early in drug discovery to 
avoid expensive and time consuming efforts with little value realized while 
also accelerating the drug discovery process and establishing a clearer path 
toward clinical development of such drugs. 

25 The gene expression profile of an affected gene set may be measured or 

already known. For measurement, expression is commonly assayed using RNA 
expression as an indicator. Thus, the greater the level of RNA (messenger RNA) 
detected the higher the level of expression of the corresponding gene. Thus, gene 
expression, either absolute or relative, such as here where the expression of 

30 several different genes is being quantitatively evaluated and compared in order to 
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establish the gene expression profile of a test compound, for example, the genes of 
a related gene set as disclosed herein, is determined by the relative expression of 
the RNAs encoded by the various gene members of the set. 



5 RNA may be isolated from samples in a variety of ways, including lysis and 

denaturation with a phenolic solution containing a chaotropic agent (e.g., triazol) 
followed by isopropanol precipitation, ethanol wash, and resuspension in aqueous 
solution; or lysis and denaturation followed by isolation on solid support, such as a 
Qiagen resin and reconstitution in aqueous solution; or lysis and denaturation in 
1 0 non-phenolic, aqueous solutions followed by enzymatic conversion of RNA to DNA 
template copies. 

Steady state RNA expression levels (i.e., basal expression) for the genes of 
an affected gene set may be known in the literature or may be determined by 

15 methods disclosed below. Such steady state levels of expression are easily 
determined by any methods that are sensitive, specific and accurate. Such 
methods include, but are in no way limited to, real time quantitative polymerase 
chain reaction (PCR), for example, using a Perkin-Elmer 7700 sequence detection 
system with gene specific primer probe combinations as designed using any of 

20 several commercially available software packages, such as Primer Express 
software., solid support based hybridization array technology using appropriate 
internal controls for quantitation, including filter, bead, or microchip based arrays, 
solid support based hybridization arrays using, for example, chemiluminescent, 
fluorescent, or electrochemical reaction based detection systems. 

25 

The present invention also relates to recombinant cells engineered to 
contain intrachromosomally or extrachromosomally one or more genes that 
together form a related gene set as described herein. Such recombinant cells are 
genetically engineered (transduced or transformed or transfected) with suitable 
30 vectors, which may be, for example, a cloning vector or an expression vector. The 
vector may be, for example, in the form of a plasmid, a viral particle, a phage, etc. 
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The engineered host cells can be cultured in conventional nutrient media modified 
as appropriate for activating promoters, selecting transformants or amplifying the 
genes of the present invention. The culture conditions, such as temperature, pH 
and the like, are those previously used with the host cell selected for expression, 
5 and will be apparent to the ordinarily skilled artisan. 

The appropriate DNA sequence may be inserted into the vector by a variety 
of procedures. In general, the DNA sequence is inserted into an appropriate 
restriction endonuclease site(s) by procedures known in the art. Such procedures 
1 0 and others are deemed to be within the scope of those skilled in the art. 

The DNA sequence in the expression vector is operatively linked to an 
appropriate expression control sequence(s) (promoter) to direct mRNA synthesis. 
As representative examples of such promoters, there may be mentioned: LTR or 
SV40 promoter, the £ coli. lac or trp, the phage lambda P L promoter and other 
promoters known to control expression of genes in prokaryotic or eukaryotic cells or 
their viruses. The expression vector also contains a ribosome binding site for 
translation initiation and a transcription terminator. The vector may also include 
appropriate sequences for amplifying expression. 

In addition, the expression vectors preferably contain one or more selectable 
marker genes to provide a phenotypic trait for selection of transformed host cells 
such as dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, 
or such as tetracycline or ampicillin resistance in E. coli. 

The vector containing the appropriate DNA sequence as hereinabove 
described, as well as an appropriate promoter or control sequence, may be 
employed to transform an appropriate host to permit the host to express the protein. 

30 As representative examples of appropriate hosts, there may be mentioned: 

bacterial cells, such as E. coli, Streptomyces, Salmonella typhimurium; fungal cells, 
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such as yeast; insect cells such as Drosophila S2 and Spodoptera Sf9; animal cells 
such as CHO, COS or Bowes melanoma; adenoviruses; plant cells, etc. The 
selection of an appropriate host is deemed to be within the scope of those skilled in 
the art from the teachings herein. 

5 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. 
Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial 
promoters include lad, lacZ, T3, 17, gpt, lambda P R) P L and trp. Eukaryotic 
10 promoters include CMV immediate early, HSV thymidine kinase, early and late 
SV40, LTRs from retrovirus, and mouse metallothionein-l. Selection of the 
appropriate vector and promoter is well within the level of ordinary skill in the art. 

In a further embodiment, the present invention relates to host cells 
1 5 containing the above-described constructs, such as the genes forming a related 
gene set as defined herein. The host cell can be a higher eukaryotic cell, such as a 
mammalian cell, or a lower eukaryotic cell, such as a yeast cell, or the host cell can 
be a prokaryotic cell, such as a bacterial cell. Introduction of the construct into the 
host cell can be effected by calcium phosphate transfection, DEAE-Dextran 
20 mediated transfection, or electroporation. 

Common methods useful herein are those described in detail in Sambrook, 
et al., Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring 
Harbor, N.Y., (1989), Wu et al, Methods in Gene Biotechnology (CRC Press, New 
25 York, NY, 1997), and Recombinant Gene Expression Protocols, in Methods in 
Molecular Biology, Vol. 62, (Tuan, ed., Humana Press, Totowa, NJ, 1997), the 
disclosures of which are hereby incorporated by reference. 

The present invention also relates to a process that comprises a method 
30 for producing a product, such as by generating test data to facilitate identification 
of such product, comprising identifying an agent according to one of the 
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disclosed processes for identifying such an agent (i.e., the therapeutic agents 
identified according to the assay procedures disclosed herein) wherein said 
product is the data collected with respect to said agent as a result of said 
identification process, or assay, and wherein said data is sufficient to convey the 
5 chemical character and/or structure and/or properties of said agent. For example, 
the present invention specifically contemplates a situation whereby a user of an 
assay of the invention may use the assay to screen for compounds having the 
desired enzyme modulating activity and, having identified the compound, then 
conveys that information (i.e., information as to structure, dosage, etc) to another 

1 0 user who then utilizes the information to reproduce the agent and administer it for 
therapeutic or research purposes according to the invention. For example, the 
user of the assay (user 1) may screen a number of test compounds without 
knowing the structure or identity of the compounds (such as where a number of 
code numbers are used the first user is simply given samples labeled with said 

1 5 code numbers) and, after performing the screening process, using one or more 
assay processes of the present invention, then imparts to a second user (user 2), 
verbally or in writing or some equivalent fashion, sufficient information to identify 
the compounds having a particular modulating activity (for example, the code 
number with the corresponding results). This transmission of information from 

20 user 1 to user 2 is specifically contemplated by the present invention. 

In accordance with the foregoing, the present invention relates to a 
method for producing test data with respect to the susceptibility of a cell to a 
selected chemical agent, comprising: 

25 (a) identifying a cell susceptible to a test compound wherein said 

susceptibility depends on a change in expression profile of a set of genes 
expressed by the susceptible cell and due to said test compound 

(b) identifying a different cell from said susceptible cell wherein said 
different cell expresses the set of gene of (a) with the expression profile of said 

30 set of genes of said susceptible cell absent the change due to said test 
compound, 
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(c) producing test data identifying said different cell as a cell susceptible to 
said selected chemical agent. 



WHAT IS CLAIMED IS: 



1 . A method for identifying a cell susceptible to a selected chemical agent, 
comprising: 

5 (a) identifying a cell susceptible to a test compound wherein said 

susceptibility depends on a change in expression profile of a set of genes 
expressed by the susceptible cell and due to said test compound 

(b) identifying a different cell from said susceptible cell wherein said 
different cell expresses the set of gene of (a) with the expression profile of said 
10 set of genes of said susceptible cell absent the change due to said test 
compound, 

wherein said expression profile identifies said different cell as being a cell 
susceptible to said test compound 

thereby identifying said different cell as a cell susceptible to said selected 
1 5 chemical agent. 

2. The method of claim 1 wherein said susceptibility is manifested by an 
inhibition of growth of said susceptible cell. 

20 3. The method of claim 2 wherein said inhibition of growth is the death of 

said susceptible cell. 

4. The method of claim 2 wherein said inhibition of growth is a cessation of 
multiplication of said susceptible cell. 

25 

5. The method of claim 1 wherein said different cell is a cell of the same 
tissue type as said susceptible cell. 

6. The method of claim 1 wherein said different cell is a cell of the same 
30 species as said susceptible cell. 
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7. The method of claim 1 wherein said susceptible cell is a cancer cell. 

8. The method of claim 1 wherein said different cell is a cancer cell. 

5 9. The method of claim 1 wherein said susceptible cell is a recombinant 

cell. 

10. The method of claim 1 wherein said different cell is a non-cancer cell. 
10 11. The method of claim 1 wherein said susceptible cell is a human cell. 

12. The method of claim 1 wherein said different cell is a human cell. 

13. A method for identifying a cell susceptible to a selected chemical 
1 5 agent, comprising: 

(a) contacting a test compound with a cell and determining inhibition of 
growth of said cell due to said contacting thereby identifying said cell as a 
susceptible cell, 

(b) determining a change in the expression profile of a set of genes 
20 expressed by said susceptible cell wherein said change is due to said contacting 

thereby identifying said set of genes as an affected gene set, 

(c) determining the expression profile of said affected gene set of said 
susceptible cell in the absence of said contacting and thereby identifying a basal 
expression profile for said affected gene set, 

25 (d) determining said basal expression profile for said affected gene set for 

a different cell from said contacted cell, 

wherein said basal expression profile for said affected gene set indicates a 
cell susceptible to said test compound 

thereby identifying said different cell as a cell susceptible to said selected 
30 chemical agent. 
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14. The method of claim 13 wherein said expression is transcription. 

15. The method of claim 13 wherein said change in expression profile of 
(b) is determined by determining synthesis of RNA. 

5 

16. The method of claim 13 wherein said change in expression profile of 
(b) is determined by determining polypeptide synthesis. 

17. The method of claim 13 wherein said change in expression profile of 
1 0 (b) is determined by determining enzyme inhibitory activity. 

18. The method of claim 13 wherein said determining in step (c) 
comprises retrieving said basal expression profile from a database. 

15 19. The method of claim 13 wherein said determining in step (d) 

comprises retrieving said basal expression profile from a database. 

20. The method of claim 13 wherein said determining of step (c) and (d) 
each comprises retrieving said basal expression profile from a database. 

21. A method for treating a disease in a mammal comprising administering 
to a mammal afflicted with said disease a therapeutically effective amount of the 
test compound of claim 1 wherein said disease is caused by a cell having the 
expression profile of step (b) of claim 1 . 

22. The method of claim 21 wherein said mammal is a human being. 

23. The method of claim 21 wherein said disease is cancer. 

30 24. A method for producing test data with respect to the susceptibility of a 

cell to a selected chemical agent, comprising: 
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(a) identifying a cell susceptible to a test compound wherein said 
susceptibility depends on a change in expression profile of a set of genes 
expressed by the susceptible cell and due to said test compound 

(b) identifying a different cell from said susceptible cell wherein said 
5 different cell expresses the set of gene of (a) with the expression profile of said 

set of genes of said susceptible cell absent the change due to said test 
compound, 

(c) producing test data identifying said different cell as a cell susceptible to 
said selected chemical agent. 

10 

25. The method of claim 24 wherein said susceptible cell is a recombinant 

cell. 

26. The method of claim 24 wherein said different cell is a human cell. 

15 

27. The method of claim 24 wherein said susceptible cell is a cancer cell. 

28. A recombinant cell expressing an affected gene set useful in the 
method of claim 1 . 

20 

29. A recombinant cell expressing an affected gene set useful in the 
method of claim 24. 

30. A method of identifying a cancer patient likely to respond positively to 
25 treatment with a selected anti-neoplastic agent comprising evaluating said 

patient's cancer for the presence of a cell equivalent to a different cell as 
identified by the method of claim 1 wherein said anti-neoplastic agent is the test 
compound of claim 1 . 

30 31. A method of identifying a cancer patient unlikely to respond positively 

to treatment with a selected anti-neoplastic agent comprising evaluating said 
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patient's cancer for the absence of a cell equivalent to a different cell as identified 
by the method of claim 1 wherein said anti-neoplastic agent is the test compound 
of claim 1 . 
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ABSTRACT 



Methods for the identification of disease-related gene sets and expression 
profiles, based on common chemical modulation using a specific chemical 
5 compound are disclosed. 



10 



15 



25 



33 



2 a 

50 C 

J " 

C w 

.a .9 

u > 



C8 S3 

I ; 

g OX) 

« "E 

«s ° 



o 

o > 
S " 



a. 2 



O « 

a c 

S - 
o s 

U 2 



I s 



Illi 

O a, 2 1 

>. a E i 



3UIBN IPD 



^ 2; 




